Non-parametric k-sample tests: Density functions vs distribution functions
نویسندگان
چکیده
In this paper we introduce some tests for the comparison of k samples based on kernel density estimators (KDE), and we develope the Double Minimum method as a new and useful procedure for the crucial problem of bandwidth selection. We study, via Monte Carlo simulations, the statistical power of the proposed tests, as well as the impact of the smoothing degree and the performance of the Double Minimum algorithm. Finally, we compare the results of the tests based on the KDE to those of the traditional k-sample tests based on empirical distribution functions (EDF), and to other tests based on the likelihood ratio introduced in the recent literature. Two main conclusions are obtained. First, the proposed bandwidth selection method attain quasi-optimal results. Second, the simulations suggest that KDE-based tests are the most powerful when the underlying populations are different in shape.
منابع مشابه
Robustness to non-normality of common tests for the many-sample location problem
Abstract. This paper studies the effect of deviating from the normal distribution assumption when considering the power of two many-sample location test procedures: ANOVA (parametric) and Kruskal-Wallis (non-parametric). Power functions for these tests under various conditions are produced using simulation, where the simulated data are produced using MacGillivray and Cannon’s [10] recently sugg...
متن کاملNonparametric Transition-Based Tests for Jump Diffusions
We develop a specification test for the transition density of a discretely sampled continuous-time jump-diffusion process, based on a comparison of a nonparametric estimate of the transition density or distribution function with their corresponding parametric counterparts assumed by the null hypothesis. As a special case, our method applies to pure diffusions. We provide a direct comparison of ...
متن کاملOptimal power flow based on gray wolf optimization algorithm using probability density functions extraction considering wind power uncertainty
In recent years, utilization of the renewable based power plants has become widespread in the power systems. One of the most widely used renewable based power plants is wind power plants. Due to the utilization of wind energy to generate electricity, wind turbines have not emitted any environmental pollution. Thus, in addition to economic benefits, utilization of these power plants is of great ...
متن کاملHeight and Crown Area Distribution of Cionura erecta Shrub lands in chaharmahal and Bakhtiari Province, Using Probability Distribution Functions
Importance of probability distribution functions in natural resource studies is increasing due to their effective roles in better understanding of vegetation structure and providing conceptual models of quantitative indices of plant species. The present study was performed to model the distribution of height and canopy area of Cionura erecta L. shrub, using probability distribution functions in...
متن کاملTopics in kernel hypothesis testing
This thesis investigates some unaddressed problems in kernel nonparametrichypothesis testing. The contributions are grouped around three main themes:Wild Bootstrap for Degenerate Kernel Tests. A wild bootstrap method for non-parametric hypothesis tests based on kernel distribution embeddings is pro-posed. This bootstrap method is used to construct provably consistent teststh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 53 شماره
صفحات -
تاریخ انتشار 2009